A Hybrid Bayesian Network Model for Predicting Breast Cancer Prognosis

نویسندگان

  • Jong Pill Choi
  • Tae Hwa Han
  • Rae Woong Park
چکیده

Objective: Breast cancer is one of the most common cancers affecting women. Both physicians and patients have concerned about breast cancer survivability. Many researchers have studied the breast cancer survivability applying artificial nerural network model (ANN). Usually ANN model outperformed in classification of breast cancer survivability than other models such as logistic regression, Bayesian network (BN), or decision tree models. However, physicians in the fields hesitate to use ANN model, because ANN is a black-box model, and hard to explain the classification result to patients. In this study, we proposed a hybrid model with a degree of the accuracy and interpretation by combining the ANN for accuracy and BN for interpretation. Methods: We developed an artificial neural network, a Bayesian network, and a hybrid Bayesian network model to predict breast cancer prognosis. The hybrid model combined the artificial neural network and the Bayesian network to obtain a good estimation of prognosis as well as a good explanation of the results. The National Cancer Institute’s SEER program public-use data (1973-2003) were used to construct and evaluate the proposed models. Nine variables, which are clinically acceptable, were selected for input to the proposed models’ nodes. A confidence value of the neural network served as an additional input node to the hybrid Bayesian network model. Ten iterations of random subsampling were performed to evaluate performance of the models. Results: The hybrid BN model achieved the highest area under the curve value of 0.935, whereas the corresponding values of the neural network and Bayesian network were 0.930 and 0.813, respectively. The neural network model achieved the highest prediction accuracy of 88.8% with a sensitivity of 93.7% and a specificity of 85.4%. The hybrid Bayesian network model achieved a prediction accuracy of 87.2% with a sensitivity of 93.3% and a specificity of 83.1%. The results of the hybrid Bayesian network model were very similar to the neural network model. Conclusion: In the experiments, the hybrid model and the ANN model outperformed the Bayesian network model. The proposed hybrid BN model for breast cancer prognosis predictin may be useful for clinicians in the medical fields, as the model provides both high degree of performance inherited from ANN and good explanation power from BN. (Journal of Korean Society of Medical Informatics 15-1, 49-57, 2009)

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Probabilistic Bayesian Classifier Approach for Breast Cancer Diagnosis and Prognosis

Basically, medical diagnosis problems are the most effective component of treatment policies. Recently, significant advances have been formed in medical diagnosis fields using data mining techniques. Data mining or Knowledge Discovery is searching large databases to discover patterns and evaluate the probability of next occurrences. In this paper, Bayesian Classifier is used as a Non-linear dat...

متن کامل

Extracting Predictor Variables to Construct Breast Cancer Survivability Model with Class Imbalance Problem

Application of data mining methods as a decision support system has a great benefit to predict survival of new patients. It also has a great potential for health researchers to investigate the relationship between risk factors and cancer survival. But due to the imbalanced nature of datasets associated with breast cancer survival, the accuracy of survival prognosis models is a challenging issue...

متن کامل

A Probabilistic Bayesian Classifier Approach for Breast Cancer Diagnosis and Prognosis

Basically, medical diagnosis problems are the most effective component of treatment policies. Recently, significant advances have been formed in medical diagnosis fields using data mining techniques. Data mining or Knowledge Discovery is searching large databases to discover patterns and evaluate the probability of next occurrences. In this paper, Bayesian Classifier is used as a Non-linear dat...

متن کامل

بررسی رابطه علت و معلولی متغیرهای مرتبط با سرطان پستان با استفاده از شبکه‌های بیزی

Background and Objectives: Breast cancer is the most common cancer in Iran. It can be prevented by rapid diagnosis of the disease. Thus, it is necessary to determine the causal relationships between variables related to breast cancer. Bayesian network is a data mining tool that shows the causal relationship between different variables. In this paper, a Bayesian network was applied to find causa...

متن کامل

Comparison of logistic regression and neural network models in predicting the outcome of biopsy in breast cancer from MRI findings

Background: We designed an algorithmic model based on the logistic regression analysis and a non-algorithmic model based on the Artificial Neural Network (ANN). Materials and methods: The ability of these models was compared together in clinical application to differentiate malignant from benign breast tumors in a study group of 161 patients' records. Each patient’s record consisted of 6 subjec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009